Picture for Shaofeng Zhang

Shaofeng Zhang

Moment-Video: Diagnosing Temporal Fidelity of Video MLLMs on Momentary Visual Events

Add code
Jun 01, 2026
Viaarxiv icon

NITP: Next Implicit Token Prediction for LLM Pre-training

Add code
May 24, 2026
Viaarxiv icon

Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling

Add code
May 18, 2026
Viaarxiv icon

SWIFT: Prompt-Adaptive Memory for Efficient Interactive Long Video Generation

Add code
May 10, 2026
Viaarxiv icon

Prompt-Free Universal Region Proposal Network

Add code
Mar 18, 2026
Viaarxiv icon

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

Add code
Mar 12, 2026
Viaarxiv icon

EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation

Add code
Mar 12, 2026
Viaarxiv icon

PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models

Add code
Feb 28, 2026
Viaarxiv icon

DreamWorld: Unified World Modeling in Video Generation

Add code
Feb 28, 2026
Viaarxiv icon

Pathwise Test-Time Correction for Autoregressive Long Video Generation

Add code
Feb 05, 2026
Viaarxiv icon